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Monotone properties and thresholds 

We use 2^ for the collection of subsets of a finite set X. Given X of size n and p S [0, 1], //p is 

' the product measure on 2^ given by fJ.p{S) = — p)"'~\^^. Recall that ^ C 2'''- is often called a 

O ■ property of subsets of X, and is said to be monotone (increasing) HB^AgJ^^EgJ^. For a 

^ : monotone T C 2^ , denote by pdJ") that p € [0, 1] for which ^piJ") (= X;{/^p('S') : S e T}) = 1/2. 

I We will assume throughout this note that ^ is a monotone property, and will always exclude the 

0^- trivial cases = 2^ and J" = 0, so in particular Pc{J~) exists and is unique. 

, Recall that for monotone Tn C 2^" (n = 1,2,...), a function po{n) is a threshold for the 

^ I sequence {Tn} if 

; ' „ fT\^l^ if p(n)/po(n) ^ 

Q . /^P(n)i-^nj ^ I 1 if p(n)/po(n) ^ oo. 

^ I (See e.g. [2], [6] or [18]. Of course Po{n) is not quite unique, but following common practice we will 

' often say "the" threshold when we should really say "a.") It follows from [8] that every {J^n} has 

d • a threshold, and that in fact (see e.g. [18, Proposition 1.23 and Theorem 1.24]) po{n) := Pc{^n) 

^ i is a threshold for {J-n\. So the quantity pc conveniently captures threshold behavior, in particular 
allowing us to dispense with sequences. 

(N ; . 

J> ' Conjecture 1 For any (nontrivial monotone) C 2 there exist C 2 and q G [0, 1] such that 

00 : 

T— I . (a) each B ^ contains some member of Q , 

Cn ; (b) EAee 9'^' < 1/2, and 

o . 

\Q . (c) q> pc{T)/{K log n), 

o ■ 

^: 

^ ' Note that, for a given g, existence of a ^ for which (a) and (b) hold trivially gives Hq{J^) < 1/2; so 

' the conjecture says that for any as above there is a trivial lower bound on Pc{^) which is within 

^ ! a factor O(logn) of the truth. To put this another way, define q{J-) to be the supremum of those 

g's for which there exists Q satisfying (a) and (b). Then Pc{^) > q{^) is trivial, and Conjecture 1 
would say Pc{^) < Kq{J^)\ogn. It is easy to see (familiar examples will be recalled below) that 
5^ , the factor logn cannot be improved. 

(Condition (b) is also considered by Talagrand in [30] and [31]. In particular a hope of [30] 
(which is also primarily a problem paper) is to say that in certain geometrically-motivated situations 
the threshold essentially coincides with its trivial lower bound (more precisely, the gap is 0(1)). 
This is different from what we are asking, but it may be that the underlying difficulties are similar.) 

In the rest of this note we will briefly mention a few consequences of, and questions related to. 
Conjecture 1. As some of these indicate, the conjecture is extremely strong. It would probably be 
more sensible to conjecture that it is not true; but it does not seem easy to disprove, and we think 
a counterexample would also be quite interesting. 
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where K is a universal constant and n = \X\. 
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Graph properties 

A basic result for random graphs states that a threshold for the (usual) random graph Gn,p to 
contain a given (fixed) subgraph H is n~^/^'^^\ where m{H) = max{\E{H')\/\V{H')\ : H' C H}. 
(This was proved for "balanced" graphs in [12], and for general graphs in [4].) Equivalently, one 
may take as a threshold the least p = p{n) such that for each H' C H the expected number of 
copies of H' in Gn,p is at least 1. 

This last p{n) — call it the "expectation threshold" — still makes sense if, instead of a fixed H, 
we consider a sequence {Hn}, where, allowing isolated vertices, we assume |y(ff„)| = n. For- 
mally, for an arbitrary H, define Pe{H) to be the least p such that, for every spanning H' C H, 
{\V{H')\\/\Aut{H')\)p\^^^'^^ > 1. Then the p{n) of the preceding paragraph is the same as pE{Hn), 
where is gotten from H by adding n — |F(i7)| isolated vertices. 

Of course in this more general situation, p^ may no longer capture the true threshold behavior. 
Well-known examples are perfect matchings and Hamiltonian cycles in Gn,p, for each of which 
the expectation threshold is (easily seen to be) on the order of 1/n, while the actual threshold is 
logn/n. (This was proved for matchings in [12], and for Hamiltonian cycles in [27] and [24]. It's 
easy to see that logn/n is a lower bound, since existence of a perfect matching and Hamiltonicity 
require minimum degree at least 1 and 2 respectively. The beautiful "stopping time" versions of 
these results ([12] for matchings, [23] and [5] for Hamiltonian cycles) say, in a precise way, that in 
each case failure to satisfy the degree requirements is the main source of failure to have the desired 
property.) 

Now, for an n-vertex H, write Pc{H) and q{H) for Pc{^) and q{J'), where C 2^ 2 ) is the 
collection of graphs on [n] that contain copies of H. So again, pc{Hn) is a threshold function for 
"Gn,p 2 Hn" Moreover, for any H, q{H) > pe{H)/2 (the 1/2 should of course be ignored). To 
see this, notice that we have an alternate definition of pe^H) analogous to that of q{H): it is the 
supremum of those p's for which there exists H' C H so that, with Q the collection of copies of H', 
J2{p^'^^ : ^ G ^} < 1. So the following conjecture, which was actually our starting point, contains 
Conjecture 1 for J^'s of this (subgraph containment) type. 

Conjecture 2 For any H, Pc{H) < KpE{H)\og\V {H)\, where K is a universal constant. 

On the other hand, it could be that Conjecture 1 implies Conjecture 2, as would follow from a 
positive answer to 

Question 3 Is it true that, for some fixed K, q{H) < Kpe{H) for every H? 

In other words, is it true that in the present situation the value of q does not change significantly 
if we require Q (in the definition of q) to consist of all copies of a single graph? Pushing our luck, 
we could extend this to a general graph property (meaning, as usual, that membership in 
depends only on isomorphism type): can we in this case require that Q also be a graph property 
(without significantly affecting q)7 In fact, we don't even know that the answer to the following is 
negative. 

Question 4 For C. 2^ let q*{T) be the supremum of those q's for which there is some Aut(J^)- 
invariant Q satisfying (a) and (h) of Conjecture 1. Is it true that q{T) < Kq*{J^) for some universal 
K? 

Returning briefly to subgraph containment, it is not hard to see that for an n-vertex tree T with 
maximum degree A(r) = A, both Pe{T) and q{T) are always between Q{l/n) and 0(A/n), and 
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are B(A/n) if A = 17 (log n). In particular, either of the above conjectures would imply that for 
any fixed A wc have pdT) < O (log n/n) for any T of maximum degree at most A. For unbounded 
degrees, the conjectures are not as precise, but as an aside we may mention the natural guess that 
there are constants Ki > and K2 such that for any tree T (with n vertices) 

i^i max{log n/n, A (T)/n} < Pc{T) < niax{log n/n, A(r)/n}. (2) 

Finally, we don't even know a counterexample to 

Conjecture 5 Given e > there is a K such that any H with £ < Pc{H) < 1 — e satisfies 
Pc{H) < Kpe{H). 

Hypergraph matching 

Write TLk{n,p) for the random /c-uniform hypergraph on vertex set [n] in which each /c-set is 
an edge with probability p, independent of other choices. A celebrated question first raised and 
studied by Schmidt and Shamir [29, 11] asks: for fixed k and n ranging over multiples of k, what 
is the threshold for Hk{n,p) to contain a perfect matching (meaning, of course, a collection of 
edges partitioning the vertex set)? The best published progress to date, building in part on [16] 
and some antecedents, is due to J.H. Kim [22], who shows that the threshold is at most 0(n~"'=) 
with ctfc = — 1 — 1/(5 + 2{k — 1)); and very recently A. Johansson [19] announced that he can 
improve this to 0(n~'^+^+°^^)). But it is natural to expect that, as for Gn,p, one usually has a 
perfect matching as soon as there are no isolated vertices, which would say in particular that (this 
was perhaps first proposed explicitly in [10]) the true threshold is n"'^^-'^ log n. In fact this would 
follow from Conjecture 1, since it's easy to see that for the collection of fc- uniform hypergraphs 
on [n] containing perfect matchings (a property of subsets of X = ('^^)), one has q{J^) = Q{n~'^'^^). 

A related problem asks for the threshold for Gn,p to contain a "triangle factor" (collection of 
triangles partitioning the vertex set). Here again one expects that isolated vertices (i.e. vertices not 
contained in triangles) are the main obstruction, which would make the threshold n~^/^ log^^'^ n. 
In this case Conjecture 1 is a little off, saying only that the threshold is at most n~^/^ log n; but 
this is still much better than what's presently known. (Here again the best published bound is 
from [22] (n"^/^"*"^/^^, improving on [25]), and n"^/^"'""^^^ has been announced in [19].) 

Isoperimetry 

Wc would like to suggest a possible, if speculative, connection between the preceding questions 
and isoperimetric behavior. The most obvious difficulty posed by Conjecture 1 is that of identifying 
the requisite Q. While such an identification based purely on combinatorial information seems 
difficult, a beautiful result of Friedgut [13] does achieve something of the sort using the Fourier 
transform (see the comments following Conjecture 6). This somewhat motivates the approach 
proposed here. 

Given a monotone T C 2^ , let m{p) = l^p{J^)- The derivative m'{p) is, according to a fun- 
damental observation of Russo ([28] or e.g. [17]), equal to the "total influence" of w.r.t. fip, 
namely 

n 

I = ip{j^) = Y,i^p{\^^{s,si^{i}}\ = 1). 

i=l 

This measure of the edge boundary of J- has recently been of importance in a number of contexts; 
see e.g. [20], [13], [21]. (The combination of Russo's lemma and lower bounds on influence has been 
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a powerful tool for proving sharp threshold behavior. A few recent results have also used such 

sharp threshold information as a tool in estimating location of thresholds; see e.g. [1],[7].) 
In the present context the edge isoperimetric inequality takes the form 

P ■ Ip{^) > m{p) logp m{p). (3) 

Equality holds whenever is a subcube, {S C X : S D R} for some R. Though one would think 
(3) would be known, we could not find it in the literature, so indicate a proof at the end of this 
note. A derivation, based on log-Sobolev inequalities, of something containing a slightly weaker 
version of (3) (in which the right hand side is multiplied by a constant Cp) is given in [26, p.lll, 
(5.28)]. 

For ^ C 2^, write (Q) for the monotone property generated by Q; that is, (G) = {B C X : G 

G,AC1 B}. (So, for example, (a) of Conjecture 1 is "J- C (G).") We would like to say, very roughly, 
that if (3) is fairly tight for a pair {T,p), then is close (in fj,p) to some property generated by 
small sets. We will say that is {C,p)- optimal if (3) is tight to within the factor C; that is, 

p ■ m'{p) < Cm{p) logp m{p). 

Thus subcubes are (l,p)-optimal for every p. 

Conjecture 6 Given C, there are K,5 > Q such that for any {Clog(l/p),p)- optimal T (and 

(a) There is an R C X of size at most K log{l / m{p)) , such that 

l^p{S G J^\S ^R)>{1 + S)m{p). 

(b) There is a family G C {A C X : \A\ < K\og{l/m{p))} such that Hp{J^A{G)) < o{m{p)). 
Furthermore, there is some fixed C > such that for any {Clog{l/p),p)- optimal T , 

(c) there is G C 2^ such that JF C (G) and E{p''^' : S eG} <l/2. 

For iJip{J^) bounded away from and 1, (b) is a conjecture of Friedgut [13], which he proved in 
case is a graph property, and (a) is similar to a theorem of Bourgain [9]. The extra log(l/p) in 
Conjecture 6 seems a little unnatural, but makes sense in light of whatever examples we've looked 
at (see in particular the prototypical "dual tribes" example below). There is also some vague 
feeling that in "cases of interest" one can multiply the right hand side of (3) by r2(log(l/p)). (Of 
course Conjecture 6 may be regarded as a statement to this effect. Also, for instance, the extra 
log is not needed if ^ip{J^) is bounded away from and 1, since (3) applied to the dual family, 
J^* = {A C X : X \ A JF}, gives (in general) 

(1 -p)I > (1 - iXp{J^))\og^_p{l - ^ip{J^)) 

(note Hi-p{T*) = 1 - Mp(^) and h-p{r*) = Ip{r)).) 

It is not hard to see (a proof is included below) that for every e > there is a C such that for 
every monotone (on a set of size n) 

3 p G [n~^Pc{^),Pc{^)\ for which T is (Clog(l/p),p)-optimal. (4) 

We don't know whether can be improved to 0(1/ log n); in fact, as far as we know even the 
following is possible. 
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Conjecture 7 For each C > there is an e > such that for any T 

3 p G [£pc(-^)/logn,Pc(-^)] for which !F is {Clog{l/p),p)- optimal. 

Of course this combined with Conjecture 6(c) would give Conjecture 1. (Considering tliat fipiJ-) 
depends only on the sequence (ao,...,an)) where = ak{J-) ■= \{A : \A\ = k}\, and that 

possibilities for this sequence are completely determined by the Kruskal-Katona Theorem, it's a 
little odd that Conjecture 7 should not be easy to settle one way or the other.) 

Duality. 

We would like to say — but don't know how — that even the logn "gap" of Conjecture 1 is 
not typical and, more particularly, implies some kind of simplicity in the dual. Could it be, for 
instance, that if p = pdJ') is bounded away from and 1 (and the logn gap is tight), then there 
is some R C [n] of size O(logn) for which ^i^p{S G J-*\S ^ R) > 1/2 + $1(1) (equivalently, 
lJip{S & J^*\Sr\R = $) < 1/2 — ^2(1))? And might this even extend to more general situations if we 
replace O(logn) by 0(p~^logn)? A similar conclusion — existence of R of size O(logn) for which 
either ^p{T\S ^ i?) « 1 or ^p{F\S (1 R = i/}) ^ — is conjectured in [14, Conjecture 2.6], in case 
both Pc{T) and fip^{T) are bounded away from and 1, and the isoperimetric inequality of [20] is 
tight to within a constant factor. 

Slightly related thoughts about the interplay of and J^* may be found in [15]; in particular, 
a variant of Conjecture 4.1 of that paper would imply that for any graph property T a weaker 
version of our Conjecture 1 — in which one only asks that (t?) contain most of T — holds for either 
.F or .F*. In fact [15] — which deals with randomized decision tree complexity, especially of graph 
properties — partly motivated our Conjecture 2; but it is not clear how far this connection goes. 
(For one thing, the aforementioned Conjecture 4.1, and the complexity conjecture that motivated 
it, are not correct for general properties.) 

A little example 

The following example (the dual of the "tribes" example of Ben-Or and Linial [3]) may help to 
put some of the preceding discussion in perspective. Several of the above conjectures are based (to 
the extent they're based on anything) on the idea that this simple construction is about as bad as 
things get. 

Suppose k\n, and let X be a set of size n, and Xi U • • • U Xm a partition of X, with m = n/k 
and \Xi\ = k for each i. Let J- consist of all sets meeting each Xj. Then m{p) (= iip{J-)) 
= (1 - {l-pfyl^ and m'{p) = n(l - {1 - pfYl^-^{l ~ pf-^ . 

If we take k = logn — log logn, then Pc{^) is bounded away from and 1, while the conclu- 
sion of Conjecture 1 requires q < 0(1/ logn) (and we may then take Q = J-). Here we achieve 
(0(log(l/p)),p)-optimality when p = ^(1/logn), while (0(l),p)- and (1 — o(l),p)-optimality re- 
quire, respectively, p = log~^^^^^^^^ n, and p = log"*^^^^ n. 

Little proofs 

Proof of (3). We actTially show this for general (i.e. not necessarily monotone) J-. Note / makes 
sense in this generality, though its interpretation as a derivative is only valid when !F is monotone. 

We proceed by induction on n. Write /x' (= /x^) for influence and measure in {0, 1}"~^. Let 
A = {x & T : Xn = 0}, B = {x ^ T : Xn = By induction, 

pi = p(l -p)l'{A) +p^l'{B) +p^^\AAB) > (1 -p)^^'iA) log^ ^^\A) +Pti'{B) log^ /x'(-B) +p^l'{Al^B), 
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and we would like to say that the r.h.s. is at least 



((1 -p)fi'iA) +pfi'{B))logp{{l-p)fi'iA) +Pfi'{B)). 
Set a = n'iA), P = n'{B). Then assuming (w.l.o.g.) that P > a, it's enough to verify the numerical 



statement 



(1 - p)a logp a + p(3 logp /3 + p{(3 - a) - {{1 - p)a + pp) logp((l - p)a + pf3) > 0. 

But this holds with equality when (3 = a, and an easy calculation shows that the derivative of the 
l.h.s. w.r.t. (3 is nonnegative (for all (3, though we only need (3 >a). 



Proof of (4). Wc prove this with C ~ l/(eln2) (with the log in (4) interpreted as log2). Set 
Pc{^) = Pc- Suppose is not (Clog(l/p),p)-optimal for any p £ [n~^Pc,Pc]', that is, 

pm'{p) > Cin{p)\og{l/m{p)) 

for each such p. We show this implies that m{n~^Pc) < a contradiction since = J2{l^p{^)\{y ~ 
X : y ^ J-}\ : x G J-} < nm(p) gives (log(l/p),p)-optimality whenever m{p) < 2~"'. 

Notice that if m{p) < 2"^ and tm'{t) > Cm(t) log(l/m(t)) for t G [(1 - l/{rC))p,p], then 
m{p{l — l/(rC))) < 2"^''"'""^^ Thus our assumption gives m{p) < 2"" for 

p = pc{l - 1/C){1 - \/{2C)) ••■(!- l/((n - 1)C)) « n-'pc. 
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and [31]. 
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